A model for multitalker speech perception.
نویسندگان
چکیده
A listener's ability to understand a target speaker in the presence of one or more simultaneous competing speakers is subject to two types of masking: energetic and informational. Energetic masking takes place when target and interfering signals overlap in time and frequency resulting in portions of target becoming inaudible. Informational masking occurs when the listener is unable to distinguish target and interference, while both are audible. A computational model of multitalker speech perception is presented to account for both types of masking. Human perception in the presence of energetic masking is modeled using a speech recognizer that treats the masked time-frequency units of target as missing data. The effects of informational masking are modeled as errors in target segregation by a speech separation system. On a systematic evaluation, the performance of the proposed model is in broad agreement with the results of a recent perceptual study.
منابع مشابه
Design Considerations for Improving the Effectiveness of Multitalker Speech Displays
Although many researchers have commented on the potential of audio display technology to improve intelligibility in multitalker speech communication tasks, no consensus has been reached on how to design an “optimal” multitalker speech display. This paper reviews a set of experiments that used a consistent procedure to evaluate the impact of six different parameters on overall intelligibility in...
متن کاملModeling the perception of multitalker speech
Listeners’ ability to understand a target speaker in the presence of one or more simultaneous competing speakers is subject to two types of masking: Energetic and informational. Energetic masking occurs when target and interfering signals overlap in time and frequency resulting in portions of target becoming inaudible. Informational masking occurs when the listener is unable to segregate the ta...
متن کاملSpatial and temporal modifications of multitalker speech can improve speech perception in older adults.
Speech perception in multitalker environments often requires listeners to divide attention among several concurrent talkers before focusing on one talker with pertinent information. Such attentionally demanding tasks are particularly difficult for older adults due both to age-related hearing loss (presbacusis) and general declines in attentional processing and associated cognitive abilities. Th...
متن کاملFactors That Influence Intelligibility in Multitalker Speech Displays
Although many researchers have commented on the potential of audio display technology to improve intelligibility in multitalker speech communication tasks, no consensus exists on how to design an “optimal” multitalker speech display. In this article, we review several experiments that have used a consistent procedure to evaluate the effect of four monaural parameters on overall intelligibility....
متن کاملMultitalker speech perception with ideal time-frequency segregation: effects of voice characteristics and number of talkers.
When a target voice is masked by an increasingly similar masker voice, increases in energetic masking are likely to occur due to increased spectro-temporal overlap in the competing speech waveforms. However, the impact of this increase may be obscured by informational masking effects related to the increased confusability of the target and masking utterances. In this study, the effects of targe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- The Journal of the Acoustical Society of America
دوره 124 5 شماره
صفحات -
تاریخ انتشار 2008